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Members of the phylum Bacteroidetes are abundant in many marine ecosystems and are known to 
have a pivotal role in the mineralization of complex organic substrates such as polysaccharides and 
proteins. We studied the decomposition of the algal glycans laminarin and alginate by 'Gramella 
forsetiF KT0803, a bacteroidetal isolate from North Sea surface waters. A combined application of 
isotope labeling, subcellular protein fractionation and quantitative proteomics revealed two large 
polysaccharide utilization loci (PULs) that were specifically induced, one by alginate and the other 
by laminarin. These regulons comprised genes of surface-exposed proteins such as oligomer 
transporters, substrate-binding proteins, carbohydrate-active enzymes and hypothetical proteins. 
Besides, several glycan-specific TonB-dependent receptors and SusD-like substrate-binding 
proteins were expressed also in the absence of polysaccharide substrates, suggesting an 
anticipatory sensing function. Genes for the utilization of the beta-1,3-glucan laminarin were found 
to be co-regulated with genes for glucose and alpha-1,4-glucan utilization, which was not the case 
for the non-glucan alginate. Strong syntenies of the PULs of G. forsetiF with similar loci in other 
Bacteroidetes indicate that the specific response mechanisms of G. forsetiF to changes in 
polysaccharide availability likely apply to other Bacteroidetes. Our results can thus contribute to an 
improved understanding of the ecological niches of marine Bacteroidetes and their roles in the 
polysaccharide decomposition part of carbon cycling in marine ecosystems. 
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Introduction 

Bacteroidetes represent important contributors to the 
mineralization of complex organic substrates such as 
proteins and polysaccharides in the marine realm 
(Kirchman 2002; Bauer et al., 2006; Gonzalez et al, 
2008; Teeling et al., 2012; Fernandez-Gomez et al., 
2013). An important source of such organic matter is 
the marine phytoplankton that has been estimated to 
be responsible for about 50% of the global net primary 
production (Field et al, 1998). Polysaccharides 
constitute a substantial fraction of this primary 
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production, and algae can be composed of up to 
50% or more of polysaccharides (Kraan, 2012). 
Marine Bacteroidetes are assumed to have a key role 
in the utilization of these algal polysaccharides 
(Kirchman, 2002; Pinhassi et al., 2004; Bauer et al, 
2006; Gonzalez et al., 2008; Teeling et al., 2012; 
Fernandez-Gomez et al., 2013), but only for few 
cultivable members this has been investigated in 
functional studies so far (Thomas et al., 2012). 
Recently, it could be shown that during a North Sea 
diatom bloom specialized populations of Bacteroi- 
detes coacted in the successive decomposition of 
algal-derived organic matter (Teeling et al., 2012). 
This supports the assumption that the wealth of 
naturally occurring algal polysaccharide structures 
and compositions provides distinct ecological niches 
for heterotrophic marine bacteria (Grossart et al., 
2005; Hehemann et al., 2010; Gomez-Pereira et al., 
2010; Gomez-Pereira et al., 2012). However, so far 
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respective niche specializations of different members 
of the Bacteroidetes are only poorly resolved. In order 
to gain deeper insights into the relationship between 
abundant algal polysaccharides and the gene reper- 
toires of adapted Bacteroidetes, we studied the isolate 
'Gramella forsetif KT0803 (Eilers et al, 2001). This 
strain belongs to the class Flavobacteria of the 
phylum Bacteroidetes and has been isolated from 
eutrophic coastal surface waters near the island 
Helgoland in the German Bight (54°11'03"N, 
7°54 / 00 // E). A previous whole-genome analysis 
(Bauer et al., 2006) of 'G. forsetif has revealed various 
genetic modules encoding carbohydrate-active 
enzymes (CAZymes) and transporters, frequently 
arranged in so-called polysaccharide utilization loci 
(PULs; Sonnenburg et al., 2010). Such PULs have also 
been reported for commensal Bacteroidetes from the 
human intestine, and are specifically induced by 
oligosaccharides (Martens et al., 2011), including 
such of algal origin (Hehemann et al., 2012). In this 
proteogenomic study, 'G. forsetif was studied during 
growth on two typical algal glycans, laminarin and 
alginate, in comparison to a glucose-supplemented 
control. Beta-l,3-glucans of the laminarin type act as a 
storage polysaccharide in brown algae, Chryso- 
phyceae, diatoms and haptophytes such as Phaeo- 
cystis spp. where they can make up about 30-50% of 
the cell dry weight (Beattie et al., 1961; Janse et al., 
1996; Davis et al, 2003). Alginate on the other hand is 
a cell wall polysaccharide as well as an intracellular 
component in brown algae that can account for up to 
40% of algal dry weight (Haug et al., 1966; Davis 
et al., 2003). It is an unbranched copolymer composed 
of 1,4-linked beta-D-mannuronate (M) and alpha-L- 
guluronate (G) that are arranged in alternating 
M-blocks, G-blocks and mixed M/G-blocks. Relevant 
proteins for the decomposition of these polysacchar- 
ides were determined by stable isotope labeling, 
followed by subcellular fractionation and high-accu- 
racy mass spectrometry. This combinatory approach 
allowed for an enhanced precision in protein quanti- 
fication and revealed two major PULs of 'G. forsetif 
that were specifically controlled by either alginate or 
laminarin. Data of this study furthermore indicate a 
cross-regulation of genes acting on similar polysac- 
charides such as laminarin and 1,4-glucans and 
revealed that some proteins so far designated as 
'hypothetical' might contribute to polysaccharide 
decomposition. Particular progress was achieved with 
respect to possible substrate specificities within the 
highly variable group of TonB -dependent receptors 
(TBDRs), which are known to be of crucial impor- 
tance for uptake of glycans by marine Gram-negative 
bacteria (Tang et al., 2012). 



Materials and methods 

Culture conditions and metabolic labeling 

'G. forsetif KT0803 was grown at 25 °C and 

170r.p.m. in an artificial seawater medium, as 
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described by Schut et al. (1993), in batches of 
100 ml in 500 ml shaking flasks until the 
mid-exponential phase (OD 600 x0.5). The medium 
was supplemented with either 14 N naturally 
labeled or 15 N-labeled ammonium chloride 
('isotopic tracer', 99%; Euriso-Top, Saint-Aubin 
Cedex, France). As algae-specific carbon sources, 
laminarin (Sigma-Aldrich, Munich, Germany; 
source: brown alga Laminaria digitata; composi- 
tion: about 25 glucose units) and sodium alginate 
(Sigma-Aldrich; source: brown algae; composition: 
61% mannuronic acid, 39% guluronic acid with 
60-400 uronic acid units) were added to a final 
concentration of 0.2%. The control cultures were 
supplemented with glucose (Carl Roth GmbH & Co. 
KG, Karlsruhe, Germany) to a final concentration 
of 0.2%. All of these experiments were carried out 
in triplicates (Supplementary Figure SlA). 



Proteome fractionation 

Cellular protein extracts were separated into three 
different fractions: the intracellular soluble, the 
membrane-bound and the extracellular soluble 
proteome (Supplementary Figure SlB). For purifica- 
tion of extracellular proteins, cells were centrifuged 
(6600 g at 4°C for lOmin) and the resulting super- 
natant was precipitated at 4 °C using 10% trichlor- 
oacetic acid, according to the protocol published by 
Antelmann et al. (2001). The precipitated proteins 
were pelleted via centrifugation (40 000 g at 4°C for 
60min), washed with ethanol and solubilized in 8 m 
urea and 2 m thiourea. To extract the membrane- 
bound and intracellular protein fractions, the cells 
were disrupted in a French press (Simaminco SLM, 
Rochester, NY, USA; three cycles) followed by 
removal of cell debris by centrifugation (40 000 g at 
4 °C for 20min). Subsequent ultracentrifugation 
(100 000 g at 4 °C for 60min) allowed for separation 
of intracellular soluble and membrane-bound pro- 
teins. Purification of the membrane fraction was 
performed, as described by Eymann et al. (2004), 
omitting the n-dodecyl-(3-D-maltoside treatment. 
Accordingly, pelleted bacterial membranes were 
homogenized, washed for 30min with a highly 
concentrated saline solution (20 mM Tris-HCl pH 
7.5,10 mM EDTA and 1 m NaCl) and ultracentrifuged 
(100000 g at 4°C for 60min). Following this, the 
same steps were carried out using a carbonate buffer 
(100 mM Na 2 C0 3 -HCl pH 11, 10 mM EDTA and 
100 mM NaCl). Finally, purified membrane and 
intracellular proteins were solubilized in 50 mM 
triethylammonium bicarbonate buffer (Sigma- 
Aldrich) supplemented with the Complete Protease 
Inhibitor (Roche, Berlin, Germany). After determi- 
nation of the protein concentration, triplicates of 
15 N-labeled protein extracts from all three different 
subcellular fractions were combined and then 
mixed with equal amounts of individual 14 N-labeled 
protein samples in a 1:1 ratio for all substrates before 
proteome analysis. 
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USA- 
ID-PAGE and protein measurements 
From each 14 N-labeled triplicate combined with the 
15 N-labeled pooled protein extracts, 15|ig of each 
fraction were separated by lD-polyacrylamide gel 
electrophoresis (10% acrylamide, protein ladder by 
Thermo Fisher Scientific Inc., Waltham, MA, USA), 
stained with Coomassie G-250 'blue silver' 
(Candiano et aL, 2004), eluted from the gel and 
digested using trypsin. Peptides were subsequently 
desalted by ZipTip columns (Millipore, Billerica, 
MA, USA) according to the manufacturer's guide- 
lines. For liquid chromatography-MS/MS measure- 
ments, peptides were subjected to a reversed-phase 
C18 column chromatography operated on a nanoAC- 
QUITY-UPLC (Waters Corporation, Milford, MA, 
USA) and separated as described by Otto et ah 
(2010). Mass spectrometry (MS) and MS/MS data 
were determined using an online-coupled LTQ- 
Orbitrap mass spectrometer (Thermo Fisher Scien- 
tific Inc.). 



Data processing 

The *.dta files were extracted from *.raw files with 
the BioworksBrowser 3.3.1 SPl (Thermo Fisher 
Scientific Inc.) with no charge state deconvolution 
and deisotoping. The data were searched by 
SEQUEST version v28 (rev. 12, Thermo Fisher 
Scientific Inc.) against a 'G. forsetir target-decoy 
protein sequence database. A set of common 
laboratory contaminants was included and all data 
were compiled using the BioworksBrowser. The 
following search parameters were used: enzyme 
type, trypsin (KR); peptide tolerance, 10 p. p.m.; 
tolerance for fragment ions, la.m.u.; b- and y-ion 
series; variable modification, methionine (15.99 Da). 
A maximum of three modifications per peptide was 
allowed. In the second iteration, the mass shift of all 
amino acids completely labeled with 15 N-nitrogen 
was taken into account in the search parameters. 
Resulting *.dta and *.out files were assembled and 
filtered using DTASelect (version 2.0.25; para- 
meters: -y 2 -c 2 -C 4 -here -decoy Reverse_ -p 2 -t 
2 -u -MC 2 -i 0.3 -fp 0.005). 



Relative protein quantification and analysis of variance 
Relative quantification of the shotgun proteomics 
data was carried out according to the principle of 
pooled reference samples published by MacCoss 
et aL (2003) using the Census software (Park et aL, 
2008) as described in Otto et al. (2010). Basically, 
according to the filtered database search peptide 
identification data provided by DtaSelect, 
both intensities for 14 N- (sample) and 15 N- (pooled 
reference) mass traces were extracted for each 
peptide identification in the respective mass 
spectrometric data and were compared within 
Census to yield a ratio between the sample mass 
trace ( 14 N) and the pooled reference mass trace ( 15 N). 
These peptide ratios were then combined to yield 



sample/reference ratio values on the protein level. 
Finally, peptide ratios and combined protein ratios 
were exported (i? 2 -values >0.7 and only unique 
peptides; proteins failing to be relatively quantified 
were checked manually in the graphical user inter- 
face for on/off proteins). Only proteins identified 
and quantified with at least two peptides were taken 
into account for further analyses. For correcting any 
potential systematic bias that might have been 
introduced by pooling of the 15 N-labeled samples 
and mixing with unlabeled protein samples, the 
quantitative data of a complete GeLC-MS run were 
median-centered. For all ratios, log 2 -values and 
averages of the biological replicates were deter- 
mined. TMEV (version 4.4; Saeed et aL, 2003) was 
applied to carry out the analysis of variance with the 
following settings: number of groups 5; P- values 
based on permutation (1000) resulting in P-values 
<0.01 as significance threshold. Quantification 
and analysis of variance data are provided in 
Supplementary Table Si. 



Gene functions and protein localization 
Pairwise identities of homologous genes were 
determined by means of the Geneious alignment 
software (Drummond et aL, 2010). Details on protein 
functions were obtained from precomputed data 
available on the UniProt Knowledgebase (Magrane, 
UniProt consortium 2011), the ENZYME database 
(Bairoch, 2000) and by using the JCoast software 
(Richter et aL, 2008). Information on the classifica- 
tion of CAZymes was derived from the CAZy 
database (Cantarel et aL, 2009). Subcellular localiza- 
tions of all proteins were predicted according to 
Romine (2011); the data are provided in 
Supplementary Table S2. 



Results and Discussion 

Proteome analysis 

In the present proteome analysis, 1691 proteins 
were identified, which corresponds to 47% of all 
protein-coding genes in 'G. forsetir . A considerable 
proportion (12%) of identified proteins could be 
assigned to the outer membrane, with one-third 
showing significant upregulation in alginate- or 
laminarin-grown cultures. Proteins enriched in the 
membrane fraction most frequently consisted 
of transporters (12%, mainly TBDRs), CAZymes 
(5%, primarily glycosyltransferases, GTs), signal- 
transducing components (3%), SusD-like proteins 
(2%) and proteins with unknown functions (35%). 

A genome-wide in silico localization prediction 
indicated that the 'G. forsetir extracellular proteins 
are in most cases attached to its outer membrane 
(Supplementary Table S2). This was supported 
by broad overlaps of proteins detected in the 
extracellular and the membrane proteome 
(Supplementary Figures S2A and S3). The exposure 
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of proteins to the extracellular space is essential for 
bacteria in order to efficiently bind macromolecular 
substrates and decompose them into smaller com- 
ponents that are suitable for uptake into the cell. 
Conceivably, it would be an economical advantage 
for marine microorganisms to keep the majority of 
substrate-binding proteins and hydrolytic enzymes 
to themselves by attaching them to the cell envelope 
rather than secreting them into the surrounding 
water. Such a strategy would prevent a loss of these 
proteins by diffusion and increase the efficiency of 
breakdown/uptake coupling (see also Azam and 
Malfatti, 2007). Intriguingly, at the same time, such 
membrane-anchored enzyme systems would be a 
means of cell attachment to nutrient-rich particles 
like marine snow. Indeed, about 74% of all secreted 
proteins found in the extracellular fraction of 'G. 
forsetii' were anchored to the outer membrane and 
thus associated with the cell (Supplementary 
Figures S2 and S3, Supplementary Table S3). 

The proteome analysis revealed that both lami- 
narin and alginate notably affected the protein 
expression patterns of the membrane and extra- 
cellular fractions, whereas distinct quantitative 
changes of proteins from the intracellular fraction 
were observed in glucose-grown cultures 
(Supplementary Figure S2B). In total, a differential 
expression of 703 proteins (20% of all annotated 
genes) was observed. Seven outer membrane proteins 
showed a particular strong increase in abundance. 
These proteins are encoded in two large alginate- and 
laminarin-dependent PULs (Supplementary Figures 
S4A and B). The membrane fraction from glucose- 
grown cells was mainly dominated by inner mem- 
brane enzymes with functions in signal transduction, 
protein translocation, electron transport and ATP 
synthesis (Supplementary Figure S4C). Moreover, 
certain TBDRs were found to be highly abundant in 
all investigated cell samples. 

Laminarin and alginate utilization loci 
PUL-like systems are frequently found in members 
of the Bacteroidetes and typically comprise outer 
membrane Sus-like lipoproteins, such as SusC-like 
TonB-dependent transporters and SusD-like 
substrate-binding proteins, which function in 
concert with linked CAZymes, as described for 
members of the anaerobic genus Bacteroides of the 
human intestine (Martens et al, 2009, 2011). The 
proteome analysis of this study demonstrated that in 
'G. forsetif two PULs are involved in alginate 
and laminarin degradation. These PULs comprise 
common components (TBDR, SusD, PKD domain 
proteins and CAZymes) and were found to be 
present in other marine Flavobacteriaceae as well 
(Figures 1 and 2). 

The laminarin-dependent PUL of 'G. forsetif 
encodes three 1,3-beta-glucanases, that is, two 
family-16 glycoside hydrolases (GH16 laminari- 
nases) and one GH3 (Supplementary Table S4). 
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On the basis of their localization at the cell surface 
and according to information in the CAZy database 
(Cantarel et al, 2009) both GH16 proteins 
(GFO_3466 and 3468) may constitute endo-acting 
enzymes that initiate polymer breakdown by 
randomly cleaving laminarin into smaller oligosac- 
charides. The latter can be taken up by a TonB- 
dependent transporter into the periplasmic space, 
where subsequent hydrolysis of terminal glucose 
residues by the presumably exo-acting GH3 
(GFO_3467) may take place (Figure 3). Both, the 
hypothetical outer membrane protein (GFO_3465, 
Supplementary Table S7) encoded in this PUL, and 
one of the two GH16 laminarinases (GFO_3466) 
feature a PKD domain, known to participate in near- 
surface protein-protein interactions (Jing et al., 
2002). This suggests the existence of a physical link 
between the proteins, possibly connecting the 
substrate-binding SusD and the glycoside 
hydrolases. 

The much larger alginate-dependent PUL encom- 
passes four operons (Figure 2, Supplementary 
Tables S5). Upstream of the typical activity unit 
(operon 3 with TBDR, SusD, PKD and CAZymes) 
three polysaccharide lyase (PL) genes are located 
(operon 1 and 2 with PL6, PL7 and PL17). Down- 
stream, an inner membrane transporter and genes 
for cytoplasmic processing of carbohydrates are 
encoded (operon 4 with genes for MFS permease, 
Sdr, KdgK and KdgA). On the basis of our expres- 
sion analysis and consistent with transcriptome 
studies on the Flavobacterium species Zobellia 
galactanivorans (Thomas et al., 2012), 'G. forsetiF 
gradually catabolizes alginate into 2-keto-3-deoxy-6- 
phosphogluconate (KDPG), which is likely directly 
funneled into the Entner-Doudoroff pathway (Preiss 
and Ashwell, 1962; Figure 3). A highly coordinated 
action of all alginate lyases (in operon 1, 2 and 3) 
and the cytoplasmic proteins 2-dehydro-3-deoxy-D- 
gluconate 6-dehydrogenase (Sdr), 2-dehydro- 
3-deoxygluconokinase (KdgK) and KDPG aldolase 
(KdgA; operon 4) is strongly supported by the 
proteome analysis, which revealed a consistent 
regulation pattern of these proteins within the 
alginate PUL (Supplementary Table S5). 

The PKD-domain-containing proteins present in 
both, the alginate (GFO_1155) and the laminarin 
cluster (GFO_3465), most likely perform different 
functions as is indicated by their low similarity 
(51.4%) and striking differences in molecular 
weight (34.0 kDa for the alginate- and 96.6 kDa for 
the laminarin-induced proteins). PKD-domain-con- 
taining proteins and alginate lyases or laminari- 
nases, respectively, were only upregulated in the 
two PULs described. 



Carbohydrate transporters and associated SusD-related 
proteins 

Efficient nutrient uptake is a critical step in 
the competition of marine bacterioplankton. 
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PUL organization and orthologous genes 



Gramella forsetii KT0803 

Zunongwangia profunda SM-A87 
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Figure 1 Expression profile (a) and gene organization of the laminarin utilization operon. (a) Protein ratios of the intracellular soluble 
and membrane protein fractions of cultures grown on laminarin compared with alginate are shown. Except for a signal transduction 
system (GFO_3462), all genes of the laminarin operon were overexpressed in the presence of laminarin. Consistency in protein levels is 
highlighted by the gray background that combines all data points of the corresponding substrate and indicates a uniform cluster 
regulation. Error bars refer to the s.d. of all biological replicates [n = 3), median-centered protein ratios are given in Supplementary Table 
S4. (b) The gene organization of the laminarin-specific PUL in 'G. forsetii' is conserved among other marine Flavobacteriaceae as 
exemplified. Identity/similarity values are listed in Supplementary Table S9. htcs, hybrid two-component system; susC, SusC-like TBDR; 
susD, SusD-family protein; pkd, PKD-domain-containing protein; gh, glycoside hydrolase. 



In heterotrophic Gram-negative bacteria, TBDRs/ 
transporters have a pivotal role in the uptake of 
complex organic matter such as from algal poly- 
saccharides. This is especially reflected by a high 
number of TBDR-encoding genes in the genomes of 
marine Flavobacteria, compared with monomer 
specialists within the class Alphaproteobacteria 
(Fernandez-Gomez et aL, 2013). The importance of 
TBDR in marine bacterioplankton is corroborated by 
in situ proteome studies of samples from the South 
Atlantic (in particular at coastal up welling zones; 
Morris et aL, 2010), the Antarctic Southern Ocean 
(Williams et aL, 2012) and the coastal North Sea 
(Teeling et aL, 2012) where TBDRs dominated the 
set of expressed transporters. These transporters 
consist of an outer membrane TBDR that is linked to 
the cytoplasmic membrane via a TonB-ExbBD 
complex. The latter allows usage of the proton 
motive force across the cytoplasmic membrane for 
transport through the outer membrane by the 
selective opening of an outer membrane pore 
(Krewulak and Vogel 2011). Owing to their sub- 
strate-binding function and high sequence diversity, 
the TBDR component is suggested to mediate 
substrate specificity (Schauer et aL, 2008; Tang 
et aL, 2012). 

Our proteome study identified 33 of the 40 
'G. forsetii' TBDRs (83%) in the membrane fraction, 



21 (53%) of which were differentially regulated 
under the three investigated substrate conditions 
(Figure 4). These findings suggest that some TBDRs 
are constitutively expressed and act as substrate 
sensors, whereas other TBDRs are dynamically 
upregulated only when these substrates become 
available. Laminarin exerted the most significant 
influence on TBDR expression (28% of TBDR 
genes responded to laminarin, 13% to alginate; 
Supplementary Figure S5). Among the laminarin- 
induced TBDRs we found (I) one encoded in the 
laminarin-specific PUL (GFO_3463), (II) one 
encoded next to a gene for a highly induced 
hypothetical inner membrane protein (Supplementary 
Table S7, GFO_2191) and (III) three TBDRs that 
were also identified in glucose-grown cultures 
(GFO_0361, 2138 and 1,4-alpha-glucanase asso- 
ciated; GFO_1029 next to two large outer membrane 
lipoproteins with adhesion domains), indicating 
regulatory overlaps between laminarin and its 
monomeric component glucose. 

Half of all TBDR genes (20 of 40) were located next 
to genes encoding outer membrane SusD-like pro- 
teins. These proteins are unique to Bacteroidetes 
(Thomas et aL, 2011), and have been suggested to be 
involved in oligosaccharide binding as well as to 
interact with TBDRs (Shipmann et aL, 2000; Cho 
and Salyers, 2001). In addition to the 15 known 
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Figure 2 Differential expression (a) and genomic structure (b) of the alginate utilization cluster, (a) Shown are protein ratios of the 
intracellular soluble and membrane protein fractions of cultures grown on alginate compared with laminarin. The gray background 
marks the clustering of protein patterns in corresponding cultures. According to the CAZy database, GFO_1150 and 1151 are fragments of 
a single PL17 enzyme, which probably hindered its protein identification. Error bars indicate the s.d. of the biological replicates [n = 3), 
protein ratio values can be found in Supplementary Table S9. (b) The four putative operons of the alginate-specific PUL are flanked by 
thermostable stem-loop structures of possible transcriptional terminators (T, terminator; changes in Gibbs free energy: delta G(T1) 
-6.9kcalmol \ delta G(T2) -20.0kcalmol \ delta G(T3) -21.3kcalmol \ delta G(T4) -8.1kcalmol 1 ). Homologous gene 
equivalents are present in related marine Flavobacteriaceae (identity/similarity values are listed in Supplementary Table S5). Dotted 
lines mark genes that belong to a separate operon, but were considered in this overview, because of their orthology to corresponding 
genes in 'G. forsetii'. susC, SusC-like TBDR; susD, SusD-family protein; pied, PKD-domain-containing protein; pi, polysaccharide lyase; 
tim, TIM-barrel-domain-containing protein; kdgF, pectin degradation protein; gntR, GntR family transcriptional regulator; mfs, MFS 
permease; sdr, short-chain dehydrogenase/reductase family protein with a putative function as 2-dehydro-3-deoxy-gluconate-6- 
dehydrogenase; kdgK, 2-dehydro-3-deoxy-gluconokinase; kdgA, KDPG/KHG aldolase; fbp, fructose-l,6-bisphosphatase; lad, LacI family 
transcriptional regulator; hyp, hypothetical protein. 
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SusD family genes (PF07980; Figure 4b 
upper section; Bauer et al., 2006), this study 
revealed five additional genes with novel SusD-like 
domains (PF12771 and PF12741; Figure 4b, lower 
section) in the genome of 'G. forsetii' . Fourteen 
(70%) of these in total 20 susD-related genes were 
expressed and exhibited similar expression patterns 
as their colocalized TBDRs, but in comparison 
with the TBDRs a lower fraction of the susD-like 
genes were regulated (only 25% of susD genes 
were regulated, mainly laminarin-dependent with 
glucose overlaps). Alginate- or laminarin-induced 
upregulation of TBDR genes occurred irrespective of 
whether these were colocated with susD genes 
(50%, Figure 4b) or not (45%, Figure 4a). 

Tandems of susC and susD genes in 'G. forsetii' are 
frequently associated with CAZyme genes (45%; 
Bauer et al., 2006). However, only few SusD-like 
proteins from GH-associated PULs were expressed 
under the tested conditions (44%, in contrast 
to 82% in non-GH-containing operons). Such GH- 
linked susD-like genes belonged to (I) the alginate 
and laminarin PUL (GFO_1154 and 3464) and were 
also associated with genes for (II) an alpha-amylase 
(GH13; SusD-like GFO_2139) and (III) a fructan- 
specific hydrolase (GH32; SusD-like GFO_0011). 
The GH32-associated susD-like gene belongs to an 



operon (GFO_0008-11, Supplementary Figure S6, 
Supplementary Table S9) that comprises a glucose- 
specific TBDR gene (GFO_0010). Surprisingly, the 
GHl3-associated susD-like gene is linked to an 
aforementioned laminarin-induced susC-like gene 
(GFO_2138), which is part of a 1,4-alpha-glucan 
utilizing PUL (GFO_2129-41; Figure 5, Supplementary 
Table S9). 

Among the group of inner membrane transporters, 
differential expression was detected for three genes 
of the major facilitator superfamily (MFS) transpor- 
ters (5 of 18 genes identified on protein level). On 
the basis of their annotation, the regulated second- 
ary carriers are specific for hexuronate/hexarate 
(GFO_1159, upregulated in alginate), alpha-gluco- 
sides and galactose/glucose (GFO_2136 and 3326, 
both exclusively detected in glucose cultures). In 
contrast, high numbers of ATP-binding cassette 
(ABC) transporters (20 of 28 genes) were found in 
the proteome of all examined substrates, 
but exhibited relatively constant, and therefore 
substrate-independent, expression levels. Like other 
marine Bacteroidetes, ABC transporters of 
'G. forsetii' almost entirely lack periplasmic 
solute-binding proteins that usually mediate 
substrate specificity (Bauer et al., 2006). This 
stands in stark contrast to the expression of such 
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Figure 3 Tentative model of the alginate (a) and laminarin (b) utilization pathways in 'G. forsetii'. Protein localizations were predicted 
in silico according to (Romine, 2011); Supplementary Table S2). (a) Four outer membrane alginate lyases of polysaccharide families 6 and 
7 (PL6 and PL7, endo-acting according to ExPASy) probably catalyze the depolymerization of alginate into hetero- or homo-oligomeric 
alpha-L-guluronic acid (GA) and beta-D-mannuronic acid (MA). The PKD-domain-containing protein (PKD) and a SusD-related 
oligosaccharide-binding protein (SusD) may function as bridges delivering the oligomeric substrate to the TBDR. Transfer through the 
outer membrane is mediated by a linkage of the TBDR to the energizing TonB-ExbBD complex. According to their suggested localization, 
periplasmic alginate lyases (putative oligoalginate lyase PL17 and guluronate/ mannuronate lyase PL7) might exhibit oligoalginate 
specificity. This allows for breakdown into 4-deoxy-5-keto-mannuronic or guluronic acid (KMA and KGA). The monomers can be 
transported into the cytoplasm by means of an MFS permease and step by step converted into 2-keto-3-deoxygluconate (KDG; by a 
putative 2-dehydro-3-deoxy-gluconate-6-dehydrogenase, Sdr), keto-deoxy-phospho-gluconate (KDPG; by a 2-dehydro-3-deoxy-glucono- 
kinase, KdgK) and pyruvate (Pyr) as well as glyceraldehyde-3-phosphate (GAP; by KDPG aldolase, KdgA). (b) The polysaccharide 
laminarin is metabolized into monomeric glucose by two outer membrane endo-l,3-beta-glucanases (GH16) and a periplasmic exo-1, 3- 
beta-glucanase (GH3). Nutrient-binding and outer membrane transport are carried out by the same protein families as present in the 
alginate utilization pathway (PKD, SusD and TBDR). Uptake of glucose into the cytoplasm is conducted by a constitutively expressed 
MFS permease (MFS) or ABC-type transporter (ABC), followed by glycolysis resulting in the generation of pyruvate. 



periplasmic-binding proteins in Alphaproteobac- 
teria, which was recently found in a metaproteomic 
study of Antarctic surface waters (Williams et aL, 
2012), and might indicate a housekeeping function 
of ABC transporters in 'G. forsetii'. 



CAZymes 

GTs participate in the biosynthesis of glycans and 
glycoconjugates that mediate various functions such 
as surface adhesion, biofilm formation, signaling or 
energy and carbon storage (Taniguchi et aL, 2002). 
Genome analyses of several marine Bacteroidetes 
(Bauer et aL, 2006; Fernandez-Gomez et aL, 2013) 
have shown high abundances of GT genes. Of the 60 
genes predicted for 'G. forsetii' (Bauer et aL, 2006), 
39 were proteomically identified in this study, with 
a dominance of the families GTl and GT2. Probably 
owing to their basic cellular function, expression of 
GTs was similar under all substrate conditions. 

Many members of glycoside hydrolase families 
exhibited non-differential expression or were not 
identified (Supplementary Table S8). Interestingly, 
however, two colocated genes, one encoding a 
GH13 (amylase/cyclomaltodextrinase, GFO_2133) 
and one encoding a GH65 (maltose phosphorylase, 
GFO_2134) were significantly upregulated in 
glucose- and laminarin-grown cultures. Both the 
genes affiliated to a large alpha-l,4-D-glucan PUL 
(Supplementary Figure 5, Supplementary Table S9) 
together with adjacent glycolytic genes. The maltose 



phosphorylase likely obtains its substrate maltose 
from the adjacent amylase/cyclomaltodextrinase 
that, owing to its dual function, may decompose 
both maltodextrins (1,4-linked D-glucose oligomers) 
and its precursors glycogen, starch and amylose 
(long-chained glucose polymers). Absence of two 
further cluster-encoded amylases (GFO_2132 and 
2141) on the protein level suggests focusing on 
dextrins as parent substrates. Two additional com- 
ponents of this cluster, an alpha-glucoside-specific 
MFS permease and a beta-phosphoglucomutase, 
were identified in a single replicate of glucose- 
grown cells that provide the cytoplasmic glycolysis 
machinery with glucose-6-phosphate. 'G. forsetii' 
notably regulated cell surface and intracellular 
utilization genes of the alpha- 1,4-D-glucan PUL in 
a differential manner (Figure 5). The outer mem- 
brane proteins TBDR and SusD were selectively 
activated in the presence of polymeric laminarin, 
whereas intracellular tri-/di-/monosaccharide pro- 
cessing enzymes and a transporter were strongly 
induced by monomeric glucose. 

Sensing of substrates 

The majority of hybrid sensor/response regulator 
signal transduction systems was identified in the 
membrane or intracellular fraction (9 of 12), some of 
which responded specifically to laminarin 
(GFO_1684, 1697 and 2345), but this response was 
insignificant in comparison with glucose-grown 
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Figure 4 Regulation patterns of (a) TBDRs and (b) TBDR-SusD PULs frequently clustered with CAZymes. Alginate- and laminarin- 
dependent relative quantitative changes in protein amounts of SusD-independent and SusD-associated TBDRs as enriched in the 
membrane fraction. Protein profiles illustrate similarities in regulation patterns of TBDRs and corresponding SusD proteins. 
Colocalization of CAZymes was almost exclusively determined for operons containing SusD-related proteins with domain PF07980. 
Except for those associated with the major alginate or laminarin-specific PUL (in bold), all CAZyme genes in these operons remained 
unexpressed (in italics). Locus tags of 'G. forsetii' (GFO) proteins are indicated. 



cells (Supplementary Table S9). Also, high numbers 
of sensors and regulators of conventional non- 
hybrid two-component systems were determined 
in the proteome (28 of 62 predicted signal transduc- 
tion elements, 45%). However, except for a few 
glucose-induced response regulators (GFO_1017, 
1420 and 1778), most of these proteins showed no 
differential expression. These findings, together 
with the omnipresence of various two-component 
signaling proteins under all substrate conditions, 
suggest a constitutive function, in which rather 
specificities than high quantities of signal transduc- 
tion systems mediate the distinct upregulation of 
polysaccharide-associated genes. We observed a 
limited expression of ECF-sigma/anti-sigma factors 
(6 of 21 genes; 29%), which indicates that sensing of 
alginate, laminarin and glucose was not predomi- 
nantly carried out via TonB-dependent transducers 
(TBDRs that are linked to a cytoplasmic sigma factor 
by inner membrane-localized anti-sigma factors; 
Koebnik, 2005). Instead, sensing was likely initiated 
by TBDR-enabled uptake of oligosaccharides into 
the periplasm in conjunction with (hybrid) two- 
component signal transduction to the cytoplasm. 
Only one sigma/anti-sigma system (GFO_1018-19) 
and an ECF-type sigma factor (GFO_3248) had an 



elevated expression in laminarin-supplied cultures, 
however, these genes were not found adjacent to a 
TBDR gene. 



Glycan-specificity and suggested functional 
assignments of proteins with unknown functions 
Of all identified proteins, 27% were annotated as 
hypothetical ones. Of these hypothetical proteins, 
70 displayed laminarin- or alginate-dependent 
expression (Supplementary Tables S6 and S7) and 
were predominantly detected in the membrane- 
bound fraction. Presence of the two polysaccharides 
resulted in increased expression levels of proteins 
that were likely involved in outer membrane 
assembly and maintenance of the cell envelope's 
integrity (alginate-dependent GFO_0835 and 1564; 
laminarin-dependent GFO_0244). Moreover, lami- 
narin induced a marked upregulation of an imely- 
sin-like protein with putative laminarin-binding 
function (GFO_2514) and its neighboring hypothe- 
tical protein (GFO_2515; Supplementary Figure S7). 
A variety of proteins with unknown functions also 
seemed to participate in TonB-dependent transport 
in laminarin-grown cultures (GFO_0462, 2190 and 
2728). In addition a TonB-like hypothetical protein 
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Figure 5 Proposed pathway for the decomposition of alpha-l,4-gmcans in 'G. forsetii' (a), organization of the cluster and its protein 
profile (b). (a) An alpha-l,4-linked glucose-polymer (glycogen, starch and amylose) is converted into oligomeric dextrin (by GH13 alpha- 
amylase, GFO_2132, GFO_2141 or GFO_2133), dimeric maltose (by dextrinase function of GFO_2133), glucose 1-phosphate (Glc-lP; by 
GH65 maltose phosphorylase, GFO_2134) and finally glucose-6-phosphate (Glc-6P; by PgmB, GFO_2135). The latter metabolite can be 
directly used by glycolysis and metabolized into pyruvate (Pyr). (b) Intracellular components contained in the operon located upstream 
were upregulated in both glucose- and laminarin-supplied cultures (f, highest protein quantity changes in glucose-grown cultures), 
whereas membrane proteins of the downstream located operon exhibited upregulation particularly in laminarin-grown cultures 
(:fP<0.01). Except for the two alpha-amylases (GFO_2132 and GFO_2141) and the LacI family regulator, all genes were found to be 
expressed in this proteome study. According to the predicted subcellular localization, log 2 ratio values are shown for the respective 
subcellular fraction of each identified protein (error bars: s.d.; protein ratios: see Supplementary Table S9). Locus tags are given as 
numbers below the genes, hyp, hypothetical protein; gapA, glyceraldehyde-3-phosphate dehydrogenase A; pfkA, 6-phosphofructokinase; 
gh, glycoside hydrolase family; pgmB, beta-phosphoglucomutase; mfs, MFS permease — possibly alpha-glucoside transporter; lad, LacI 
family transcriptional regulator; susC, SusC-like TBDR; susD, SusD-family protein; susE, putative SusE-like protein. 



(GFO_2900) showed increased expression in glucan- 
supplied cells. Its functional assignment is sup- 
ported by a similar membrane expression pattern 
detected for the associated TBDR component ExbD 
(GFO_2902). Furthermore, a laminarin-dependent 
putative SprE-like gliding protein (GFO_3274) and 
an alginate-dependent hydrolase (GFO_0138) were 
upregulated in polysaccharide-supplied cultures. 



Ecological implications 

'G. forsetii' KT0803 is able to specifically respond to 
the presence of the algal polysaccharides laminarin 
and alginate. Its glycan utilization strategy com- 
prises a distinct regulation of regulons with mostly 
cell surface proteins that allow for binding, uptake 
and degradation of complex sugars. The availability 
of either laminarin or alginate induced two PULs 
that share common characteristics (TBDR, SusD, 
PKD and CAZymes). Upregulation of the PKD- 
domain-containing proteins and laminarinases or 
alginate lyases, respectively, was restricted to these 
major PULs and thus highlights the importance of 
such clusters for polysaccharide decomposition. 
Moreover, the presence of homologs of PUL- 
encoded genes in related marine Flavobacteriaceae 
indicates a conserved glycan utilization process. 
To accommodate the varied composition and 
complexity of polysaccharides, PULs seem to be 



organized as matching modules, in which genes can 
be rearranged, modified, added or removed in order 
to allow an evolutionary adaptation to the break- 
down of specific substrates. Accordingly, utilization 
of the low-complexity substrate laminarin involves a 
smaller PUL compared with that of the more 
complex polysaccharide alginate. 

'G. forsetii' lives in coastal surface waters and 
occupies an ecological niche that is defined by 
polysaccharide resources from algae. A respective 
adaptation is evident from the high number of susCI 
susD-like gene pairs and CAZymes in 'G. forsetii' that 
notably exceed those found in the oligotrophic 
ocean-adapted Flavobacteria, Polaribacter spp. and 
Dokdonia spp. (Fernandez-Gomez et ah, 2013). One- 
third of all TBDRs in 'G. forsetii' were involved in the 
laminarin-specific response. In addition, partial 
co-regulation of TBDRs by both laminarin and its 
monomer glucose indicates that not only the glyco- 
side linkage type but also sugar composition controls 
PUL expression in Bacteroidetes. Likewise, crosstalk 
between the PULs specific for the beta-l,3-glucan 
laminarin and the alpha- 1,4-glucans in 'G. forsetii' 
demonstrates how the catabolism of two polysacchar- 
ides that are composed of the same monomer, but 
differ in linkage type can be efficiently co-regulated. 
A selected set of TBDRs and SusD-like proteins was 
found to be constitutively expressed under all tested 
substrate conditions. This suggests that TBDRs do not 
only act together with SusD-like proteins in binding 
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and uptake of specific substrates, but also have a 
general function in the early recognition of nutrient 
availability, probably via periplasmic signal trans- 
duction systems. Likewise, Thomas et al. (2012) 
suggested the presence of 'sentry' proteins waiting to 
recognize complex sugars and to trigger the alginate 
degradation pathway in the marine flavobacterium 
Zobellia galactanivorans. Moreover, a transcriptome 
study of the gut bacterium Bacteroides plebeius by 
Hehemann et al. (2012) showed a basal expression of 
selected CAZymes without an algal polysaccharide, 
but in the presence of its respective monomer. Such 
'sensing' mechanisms may ensure a rapid response 
when algal polymers become available. 

On overall, this proteogenomic study of the marine 
Bacteroidetes 'G. forsetii' allowed a detailed func- 
tional characterization of a series of enzymes, 
transporters and sensing proteins that concertedly 
participate in the consumption of typical algal 
polysaccharides. It can serve as a useful starting 
point in assessing substrate utilization capacities and 
metabolic versatilities of other Bacteroidetes repre- 
sentatives, such as members of the genera Polaribac- 
ter and Formosa, which have been identified as key 
organisms in the bacterioplankton during an algae 
bloom (Teeling et al., 2012), or related bacteria in the 
human intestine (Martens et al., 2011). Such func- 
tional analyses of model bacteria have the potential 
to establish protein biomarkers with well-character- 
ized specificities that will be valuable for the 
interpretation of future metaproteomic in situ studies 
and thereby will ultimately deepen our understand- 
ing of the concerted polysaccharide breakdown by 
complex natural microbial communities. 
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